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Introduction 


The  Sen  10a  gene  product 
encodes  a  tetrodotoxin-resistant 
sodium  channel  (SNS/PN3) 
expressed  exclusively  in  a  subset  of 
primary  sensory  neurons  (e.g.,  dorsal 
root  and  nodose  ganglia)  believed  to 
be  involved  in  pain  transmission. 

Thus,  it  is  important  to  understand 
mechanisms  contributing  to  both  the 
function  of  the  protein  and  the 
exquisite  specificity  of  gene 
expression.  The  overall  research  plan 
is  detained  in  the  flowchart  depicted 
to  the  right.  Significant  progress  was 
made  during  the  latest  funding  period 
on  both  the  genomic  (left  branch)  and 
proteomic  (right  branch)  sections  of 
the  research  plan  (figure  1). 

Expression  analysis  of  the  4.0  kb  genomic  sequence  immediately  upstream  of  the 
transcriptional  start  site  of  the  Sen  10a  gene  revealed  that  the  1.5  kb  sequence  distal  to  the 
transcriptional  start  site  is  essential  for  gene  expression.  The  transcription  factor  c-Jun 
(API)  was  found  to  bind  to  the  far  upstream  region  of  this  4.0  kb  sequence.  Evidence  of 
additional,  but  as  yet  uncharacterized,  transcription  factor  interaction  in  the  far  upstream 
region  was  detected.  To  isolate  transcription  factors  that  interact  within  sub-regions  of 
the  4.0  kb  region,  we  have  utilized  the  yeast  one-hybrid  technology.  A  number  of 
putative  transcription  factors  that  interact  within  specific  regions  essential  for  Sen  10a 
gene  transcription  have  been  isolated  and  their  identity  is  currently  being  assessed. 

Lastly,  a  wide  range  of  cDNAs  encoding  wild-type  and  mutant  (constitutively  active  or 
dominant  negative)  forms  of  Ga,  Gj3,  and  Gy  subunits  were  constructed  to  assess  their 
role  in  modulating  expression  of  the  Sen  10a  sodium  channel  in  future  co-transfection 
assays  of  mouse  dorsal  root  ganglia  (DRG). 
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Expression  studies  with  4.0kb  region, 


The  sequence  of  the  4.0  kb  region  upstream  of  the  transcriptional  start  site  of  the 
Sen  10a  gene  is  shown  in  figure  2. 

Figure  2 

Sequence  Range:  1  to  4032 

>LMPCR_( round_two ) _product_SSPl_libr ary 

I 

j  10  20  30  40  50  60 

ATTCCAGTTGCTGAGTGGAGAGAGCACTGTAGGGTCATGGAAGGACAGTGGGGAGGTCTG 

70  80  90  100  110  120 

TTAGAGGTCCTTGAAATTATATAGTGACCTCGCCATGATGGTGGTCTCAGAGATCGAGAG 

130  140  150  160  170  180 

ATGATGTAATCAGGAGGACTCTAGGAATTCAAGTTAGAGGCCCCAGAAAGAGGGCTGTGG 

190  200  210  220  230  240 

ACGAGGGACGGCTCTTGGATTACCTCTAGATGCTGGGCTTGTGAGTCCAGGCAAGCAGAG 

250  260  270  280  290  300 

TGTTCTTGGAGAGGCTTCTCTGGGGGAGGATCATTCTGAGCAGGGCACAGGCACAGAAAT 

310  320  330  340  350  360 

CATTAGTCCATCTGTAAACATGTCTGAGATGTTAGTGGAGTGTCCATGAAGGGAAATTCA 

370  380  390  400  410  420 

GGCTTC  TAC  C AC ATT AGTGT ATAT  TT AAATC  TGAC ACC AGG AGAG AGAT  TTATGAT  GGAG 

430  440  450  460  470  480 

CTGACAGACTCCGGTGCCATGTCAGGTAGGTGACTGAAGCCCTGGGGAAGGAGAGGCGTA 

490  500  510  520  530  540 

GGATGGAATCTTAAAACGATTCTCCAACTACTTCCAGGTGGCAGAGGAGGAGGCAGCCCA 

550  560  570  580  590  600 

GGCCAGAGAAGCTCCTCTGAAAACAGAAGTCAAGAGGGTGGAGTGTGGTGCAAGGACCAT 

610  620  630  640  650  660 

GCAGCTAATCCTGCGGAGCCCCTAGGATGAGAGCGCCAGAGAGGAGACACATGACACAGG 

670  680  690  700  710  720 

GAGACCAGTAGAAACCTGTTAAGATTCCGGGTGTCTCAGGACTGCCTCTGGATGCACACT 

730  740  750  760  770  780 

TCTTCCTTCTTGGGAAGTTACTTTTCTGTCACTGTGATGAAATACCTTAACCAAGGTGAC 

790  800  810  820  830  840 

TCAAAGAAGAGAGGGTTTATCTGGGCTCACGGGTCCAGAGGTAGAGGAACACATGGAGAT 

850  860  870  880  890  900 

CGTGGTGGGGAACCAGTGTAGCAGGCAAGCATGGTGGCTGGGGCTGAGGCTGAGAGCTTA 

910  920  930  940  950  960 

TATCTTTGTCTGTATACAGAAAGCAGAGAGAGCCAACTGGGAATGACTTGTGGCTTTTGG 

970  980  990  1000  1010  1020 

AACCTGAAACCTGTCTTCGGTGACATGCTCCCTCCAGCGAAGGCAATGCCTCCTCAAACT 

1030  1040  1050  1060  1070  1080 
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CCCCAAAGGGCACCACAAACTAGGAACCAAGCACTCAGATGCCCGAGACTATGAGCGACA 

1090  1100  1110  1120  1130  1140 

TCTCCTTCAGATCACCACACTTGGGTACACCCATTCTCCTGTCTCATCCAGTTTGCTCTT 

1150  1160  1170  1180  1190  1200 

CTGGAAGGGTGGTGAGAGGGATGACAGCTAGTGACAAGTTGGAGAGACTTTAGAATAATT 

1210  1220  1230  1240  1250  1260 

GCCATCACACAAAGCCTACCCTATCAGTTAGTGGCTGGCACGCTATCCCAACAGCTTGAG 

1270  1280  1290  1300  1310  1320 

TCTGAACTTGCCAGAAATGGCCTCCGTCTCACCTCTCCCAGGCTCCCCAGCACCCACAGG 

1330  1340  1350  1360  1370  1380 

TGCCCTCCCCCAAGACCTGACATCATCGGAGCACTGAAGAGATGCCTCCTCTGCCCCTTT 

1390  1400  1410  1420  1430  1440 

CTCCCCTGGTCTGATTGCTACCAGGCAGCTGATCCACATGCCCTGCTCCAAGTTTGACCC 

1450  1460  1470  1480  1490  1500 

CAGTCAGCAGGCTTCTCTGAAGAAGAGGGTCTGTTAGCATGACACACAGCATTTCCCATG 

>overlap_LMPCR_round_l+2 

1510  1520  1530  1540  1550  1560 

CAACAGAACCTTGGAACCTAGGACAGAACTCAAGATATCAACGTGACACACACATGTACA 

1570  1580  1590  1600  1610  1620 

TGTACCCTTACACACCTGAACGTGCATATACACACGTACACTTGTACACACACTAAAATA 

>LMPCR_( round_one )_product_EcoRV_library 

I 

1630  1640  1650  1660  1670  1680 

ACTGTGGTGCCAGGAGTGATCAGGGTACTGAGGTGACAGTCATTCACAAGGCTTGCCTTT 

1690  1700  1710  1720  1730  1740 

CTCCACGTGACACACTTTGACCTTACCACCTGTTTCCACCTTGCTCTGGCATTTTAAAAA 

1750  1760  1770  1780  1790  1800 

CATGACATTTTTAGTAAATTCTTTGAATTTTTTTGAGACAGGGTCTTACACTACAGCTCA 

1810  1820  1830  1840  1850  1860 

AGCTGATCTTGGGTTTGCAGCAATCCTCTTCCCTCGACCACCCCCCCAACTAGGATGTGA 

1870  1880  1890  1900  1910  1920 

GCTGCCATGCCCAGTTTGACTCTTTCCCAGATGTTTGTTTTTATTCTGTATGTATGAGTG 

1930  1940  1950  1960  1970  1980 

TTCTCACTGTATGTATGTCTGTATATGCACCATGTAGAGCCCAAAAGCAGTTGCTGAATG 

1990  2000  2010  2020  2030  2040 

CTCCGGAGCTTGGAGCTGTGGGTGGCTGTGAGCTGCCACGTGGGGTGCCAAGAATAGAAC 

2050  2060  2070  2080  2090  2100 

ACAGGTCCTCTGCAAGAGCATCGAATGCTCTTAACCACTGAGCTATCCCTCCAGGTATTA 

2110  2120  2130  2140  2150  2160 

AAACATAAAATAAGCTTGACTTTTTACTCTGACATAATTTAAAATTCACAAAAAGTTTTA 

2170  2180  2190  2200  2210  2220 

AAAAAACGTAACAGCTTCCGTATACTCCCAATCCCATTTCCCCAGTTAGGATATTCTTTA 

2230  2240  2250  2260  2270  2280 

ACCATAGTACATTGTCAAATGAGAAAACTAACATTCATACAACACGACTGATTTTGGTGA 
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2290  2300  2310  2320  2330  2340 

AAATCCTATTTGGAGTACACTACCTTTGACTGTGATTTCTTTTTGCACTATGGCCCAGGC 

2350  2360  2370  2380  2390  2400 

TAGCCTGCAACTATTTATCATTATTTATCCCAGGGTGACTTGAACTTAGGGCAATTCTTC 

2410  2420  2430  2440  2450  2460 

TACCTCAGCTCTGCCCCCACTCCCACCTCAACTCCAGTCCTGGGGTTACAGGAGCAAGCC 

2470  2480  2490  2500  2510  2520 

ATCAAGTTCTATAACATTTAATACACAAGGACACTGGTTAAACTCAGAAGGACCTAAATT 

2530  2540  2550  2560  2570  2580 

AGCATAAGACTATGGGGACCAGAGAAGTGAGAAGTGAGGACAGGGGAGGAGGGCAGGGGA 

2590  2600  2610  2620  2630  2640 

GGGAAGATGGGAGGAATGATGGGAAGAGAATGAGAGAAGGCAGGGAGGGAGAGGAGAAGG 

2650  2660  2670  2680  2690  2700 

CCAGTGAAGGGAGAATGGGAAGGGAGGGAGTTGAGAGAAGGCAGGATCGGGAGCCATAGA 

2710  2720  2730  2740  2750  2760 

ATGTCTGTAGGAAACCATCAAAGGCATTTAATTTAATAAAGCAACCAGGATTGTACATAA 

2770  2780  2790  2800  2810  2820 

TTCTACTGTGTACATACAAACACTCAAGTTTTGGGAGCAAGAATTTTAGCTTTCCTTCCC 

2830  2840  2850  2860  2870  2880 

CTGCCCCTTTATGATTCAGTCTCTGCTAGAAAAAGTGGAGCCTTGCAGGGTGTGGTGGTG 

2890  2900  2910  2920  2930  2940 

CACGCCTTTAATTCCAGCGTTTGGGAGGCAGAGGCAGGTGGATTTCTGTGAGTTCCCGGT 

2950  2960  2970  2980  2990  3000 

CAACCAAATCTCCATAGTATGATCCTTCGTGGAATACCGGCCAACCAAACAAGCAAACAA 

3010  3020  3030  3040  3050  3060 

ACAAACAAAAATCCCAAACAAACCCCACCCCCACCCAAATAGAGGGGATTATTGACTCAA 

3070  3080  3090  3100  3110  3120 

AGAAGCCAATAATTTTGAGTTGGTTTGGGACATTTGAGTAAATGAAGCTGTAATGGGCAA 

3130  3140  3150  3160  3170  3180 

GCATGGGCCCTCGACAGTTTCCTGCAGTATAGCATGGCTTCCTAAGGCTGCGTGGGTTGC 

3190  3200  3210  3220  3230  3240 

ACTGTTACGGAGGGCTCAGCTCAGACAGGGGGTTCCCTGTGCAACCTCCTTTCTTATGGT 

3250  3260  3270  3280  3290  3300 

CCCACAACCCCACAGATAGGGCACTTTCCCTACCCAGCTCCCTTCTCGGCTCTCACTGGG 

3310  3320  3330  3340  3350  3360 

GTCGGAGAACATTTTTGTTTCAGCATTTCATCTGAAGCCACGGTTTCACATCATCAAGTC 

3370  3380  3390  3400  3410  3420 

TGCAAAAAACCGTTCACAAACCACACCAGAACTTCTCGGTAAAGAACTCCTAAGACCAAA 

3430  3440  3450  3460  3470  3480 

GAGGGAGACTGGGTAGATTGTTTTTAATTTGTTTCTTTTTGTCAAAGGGGGACAAAACAC 

3490  3500  3510  3520  3530  3540 

GCTTTGGTGAGTGCGAGTGTTTATTCTGGGACACAAACCCAGAGTCTGGAAGGGAGCATT 

3550  3560  3570  3580  3590  3600 
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CAACGGGTGCTGCTCTGCCACGCAGGGGCAGCGGTGGGACTCAGCCCATCCTGCTAAGGA 


3610  3620  3630  3640  3650  3660 

CGGGCAGCCTGAGCCAGGCTTGGGAGTCTGTCATGGCTGCCAGACGAATCATTATCTAAT 

3670  3680  3690  3700  3710  3720 

TGCAGCCTTTTCTCTTCCTTAGGTTTCAGCAGGTCCCGAGAGAGCATTTAAAATCGCATT 

3730  3740  3750  3760  3770  3780 

TACTACTTTACCATCTAATCACACATAAGCCTCTCCCTATACCCTCCACCCTCCTTCCAT 

3790  3800  3810  3820  3830  3840 

TCAGAGTGTACTTTCTGGAGCCCATCCAGCAAGCAGGGTGGAACTCATGACGGGAAATGG 


3850 


3860 


3870 


3880 


3890 


3900 


GAACGGCGCCCACGAAGGCGTGATTCCTTGTAGATCCTTGAGTGATGGACGGGTGAGGTT 

>RACE__clone_B_end 

3910  3920  3930  3940  3950  3960 

TCCGTCAGGCAAGCCCAGCCACCTTCGTGGAGGAGCCCCGGACAAGTGTAAGTTTCGCAG 

>RACE_clone_I_end 

I 

3970  3980  3990  4000  4010  4020 

AGCTGGGGTCTCCAGCTTACTTCTGCTAATGCTACCCCAGGCCTTTAGACGGAGAACAGA 


4030 

TGGCAGATGGAG 


Two  PCR  products,  1.7  kb  and  2.5  kb  in  size,  corresponding  to  the  transcriptional 
start  site  distal  and  proximal  portions  of  the  4.0  kb  sequence  were  cloned  into  the  pEGFP- 
1  vector  from  Clontech  (figure  2).  This  vector  contains  the  coding  region  of  the  enhanced 
green  fluorescent  protein  down  stream  from  a  multiple  cloning  site.  The  vector  allows  the 
analysis  of  sequences  for  promoter  activity  by  their  ability  to  drive  expression  of  the 
EGFP  protein  product. 


Figure  2.  pEGFP  expression  vector. 
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The  resulting  expression  constructs  were  microinjected  into  the  nuclei  of  neurons 
from  primary  cultures  of  dorsal  root  ganglia.  A  nuclear  targeted  dsRED  construct  was 
coinjected  as  a  positive  control.  The  presence  of  visibly  red  nuclei  indicated  a  successful 
injection  yet  would  not  interfere  with  the  detection  of  the  EGFP  signal  that  was 
predominantly  cytoplasmic.  The  neurons  were  dissociated  with  collagenase  and  trypsin 
and  cultured  for  two  days  in  the  presence  of  nerve  growth  factor  and  glial  derived 
neurotrophic  factor.  The  construct  containing  the  2.5kb  fragment  failed  to  produce  visible 
EGFP  production  as  shown  in  figure  3.  The  4.0kb  fragment  successfully  drove  expression 
in  a  majority  of  but  not  all  injected  cells. 


Figure  3.  The  2.5  kb  fragment  does  not  drive  expression  of  EGFP  in  mouse  DRG 
neurons. 
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Figure  4  shows  cells  expressing  EGFP  following  injection.  Figure  5  shows  an  example  of 
a  successful  injection,  as  viewed  by  dsRED  production,  with  no  EGFP  production.  The 
expression  of  Sen  10a  in  only  a  subset  of  small  diameter  neurons  in  DRGs  may  account 
for  the  failure  of  this  construct  to  express  in  all  injected  cells.  Injection  of  all  constructs 
into  sympathetic  neurons  isolated  from  superior  cervical  ganglia  failed  to  produce 
visually  detectable  levels  of  EGFP.  Sen  10a  is  not  expressed  in  these  neurons  and 
therefore  this  experiment  serves  as  a  negative  control. 

Figure  4.  The  4.0  kb  construct  drives  expression  of  EGFP  in  DRGs 


-3938  Pr 

LM-PCR  round  1  -2  (4032bp) 

“  LM-PCR  round  1 

LM-PCR  round  2  2.5kb 

1.7kb 


Green  Red  Merge 

Mouse  DRG  in  culture  2  days  (NGF+GDNF) 

0. 1  //g///l  4kb  +  and  0.005  /rg//d  dsRed2-nuc 


Deletion  analyses  of  the  4.0kb  fragment  has  begun.  Three  deletion  fragments  of 
the  5’  end  of  the  4.0kb  fragment,  designated  S,  M,  and  L,  have  been  generated  by  the 
PCR  and  cloned  into  pEGFP-1 .  These  constructs  designated  S,  M,  and  L  were  generated 
by  designing  primers  to  various  positions  of  the  parent  4.0kb  fragment  as  shown  in  figure 
5.  Preliminary  injection  experiments  performed  as  described  above  with  the  M  or 
medium  sized  construct  produced  visibly  green  cells.  This  suggests  that  essential  cis 
elements  lie  within  the  region  in  construct  M.  Sequence  analysis  of  the  4.0  kb  region 
(figure  6  and  7)  reveals  a  number  of  putative  cis  elements  and  silencer  elements  that  bind 
specific  transcription  factors. 
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Figure  5.  5’  Deletion  strategy  for  4.0kb  deletion  constructs. 


6500  1300.  1950.  2600.  3250, 

3900^  *  4550,  J>200(  5 

1 

1  4.0kb  ScnIOa  promoter  ► 

|  4.0/500-525Xhol(L)  ► 

|  4.0/857-880Xhol(M)  ► 

|  4.0/1 238-1 2 59Xhol(S)  ► 

|  anchor  primer  3’end  4 

Figure  6.  Putative  cis  elements  in  the  4.0  kb  region. 


•Many  putative  T.F.  binding  sites  (TRANSFAC) 

•No  TATA  or  CAAT  (expect  multiple  start  sites) 

•Focus  on  relevant  sites  found  with  high  stringency 
•Many  MyoD  and  GATA  (muscle/heart/erythroid  specific?) 


-3938 


API 

-3489 


API 

-3195 


API 

-3163 


N-myc 

-2257 

AP4  **RN2  N-myc 

-2534  ~2451  -2252  -1924 


OCT-1 


-1421 


API 

-886  +1 


Ur 


Figure7.  Location  of  putativesilencer  elements  NRSE  (Neuron  restrictive  silencer 
element)  and  DNRE  (Dopamine  beta-hydroxylase  neuron  restrictive  element). 


Weak  NRSE  or  DNRE  homology 
Neuron  Restrictive  Silencing  Element 
(M4,  Nav1.2,  Synapsin) 

Dopamine  (5-hydroxylase  Neuron  Restrictive  Element 
(rat  and  human) 


DNRE  Putative  NRSE(DNRE)  cluster  NRSE 

-3390  -2466to-1364(8)  -704 

-3938  |  I  I  I  I  I  I  I  I  I 
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Electrophoretic  mobility  shift  analysis  (EMSA)  of  the  4.0  kb  region. 

Since  the  2.5  kb  fragment  was  not  able  to  drive  expression  of  EGFP  in  transfected 
DRGs  but  the  4.0  kb  region  could  not,  this  indicates  that  the  1.5  kb  region  distal  to  the 
transcriptional  start  site  contained  essential  cis  elements.  Therefore,  the  focus  of  this 
granting  period  was  on  identifying  essential  cis  elements  in  the  1.5  kb  region.  The  1.5  kb 
region  was  divided  into  100  bp  sections  (15  total)  by  the  PCR  and  each  100  bp  fragment 
was  labeled  with  [y32P]ATP  and  incubated  with  nuclear  extract  protein  from  DRGs. 

Three  regions,  -3100  to  -3200,  -3300  to  -3400,  and  -3400  to  -3500,  were  able  to  bind 
one  or  more  proteins  present  in  the  DRG  nuclear  extract  (figure  8).  Analysis  of  the 
sequences  from  these  regions  (figures  6  and  7)  indicates  the  presence  of  putative  binding 
sites  for  the  API  protein,  c-Jun,  and  a  neuron  restrictive  silencer  element  (NRSE).  When 
purifed  c-Jun  was  incubated  with  each  sub-region,  binding  of  c-Jun  to  the  -3 100  to  -3200 
was  evident  (figure  8,  lane  4).  Therefore,  one  of  the  two  DNA/protein  complexes 
visualized  when  the  -3 100  to  -3200  region  was  incubated  with  DRG  nuclear  extract 
protein  contained  c-Jun  protein.  The  identities  of  the  other  nuclear  extract  proteins  bound 
to  the  three  regions  are  presently  unknown. 

Figure  8.  EMSA  of  specific  sub-regions  of  the  4.0  kb  region. 
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One-hybrid  analysis  to  isolate  transcription  factors  bound  to  the  4.0  kb  region. 

The  yeast  one-hybrid  technique  is  currently  in  use  to  identify  transcription  factors 
from  DRGs  that  bind  to  the  three  sub-regions  of  the  4.0  kb  region.  The  general  scheme 
for  one-hybrid  screening  is  shown  in  figure  9. 
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Figure  9.  One-hybrid  analysis  of  the  -3100  to  -3200  region. 
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Transcription  factors  contain  at  least  two  domains,  a  binding  domain  (BD)  and  an 
activation  domain  (AD).  The  BD  is  used  to  bind  specific  DNA  sequences  within  the 
promoter  for  a  gene,  whereas  the  AD  is  needed  for  the  transcription  factor  to  activate 
gene  transcription.  The  one-hybrid  analysis  uses  the  100  bp  sub-regions  from  the  4.0  kb 
region  as  a  target  for  any  protein  containing  a  BD  for  a  specific  DNA  sequence  on  each 
sub-region.  The  AD  is  supplied  by  the  Gal4  transcription  factor.  A  cDNA  library  was 
constructed  from  total  RNA  isolated  from  mouse  DRGs  and  ligated  into  the  vector 
pACT2  (Clontech)  that  contains  the  AD  for  Gal4  upstream  of  a  multiple  cloning  site  for 
the  DRG  cDNAs.  Target  reporter  genes  containing  each  100  bp  sub-region  fused 
upstream  of  the  yeast  gene  HJS3  were  constructed  in  the  vector  pHISi-1  (Clontech). 
When  the  DRG-pACT2  library  is  transformed  into  yeast  cells,  the  DRG  cDNAs  are 
expressed  as  fusion  proteins  to  the  Gal4-AD.  If  the  DRG-pACT2  library  is  transformed 
into  a  yeast  strain  containing  an  integrated  copy  of  each  100  bp-pHISi-1  vector,  any 
fusion  protein  containing  a  BD  for  specific  DNA  sequences  in  the  100  bp  sub-regions  is 
expected  to  bind  within  the  100  bp  region  and  activate  gene  transcription  of  the  H1S3 
gene.  Growth  of  a  histidine-requiring  yeast  strain  containing  an  integrated  copy  of  a  100 
bp-pHISi-1  vector  on  media  lacking  histidine  indicates  that  a  fusion  protein  capable  of 
binding  to  the  100  bp  sub-region  is  present  in  the  transformed  strain.  Using  this 
approach,  we  identified  42  DRG-pACT2  clones  that  contain  putative  BDs  for  DRG 
transcription  factors  that  bind  to  the  -3100  to  -3200  sub-region  of  the  Sen  10a  promoter. 
We  currently  are  analyzing  these  clones  by  DNA  sequence  analysis  to  determine  whether 


they  correspond  to  known  transcription  factors  or  are  novel.  As  time  permits,  the  other 
two  sub-regions  will  be  analyzed  in  the  same  fashion. 

Cloning  wild-type  and  mutant  Ga,  G0,  and  Gy  subunits  for  future  analysis  of  their 
effect  on  ScnlOa  function. 

The  purpose  of  these  experiments  is  to  determine  whether  G  protein  a  and/or  |3y 
subunits  modulate  the  ScnlOa  sodium  channel  in  sensory  neurons.  Various  combinations 
of  G  proteins  will  be  co-expressed  in  mouse  DRG  or  nodose  ganglion  neurons,  and 
whole-cell  voltage-clamp  recording  of  tetrodotoxin-resistant  sodium  channel  activity  will 
be  made  using  the  patch-clamp  technique.  During  this  funding  cycle,  a  large  number  of 
wild-type,  constitutively  active,  and  dominant  negative  forms  of  Ga,  G|3,  and  Gy  genes 
have  been  isolated  by  our  Guthrie  cDNA  Resource  Center  staff  (see  website 
www.cdna.orgl.  The  number  of  clones  has  expanded  greatly  since  the  start  of  this 
proposal.  The  cDNAs  were  prepared  by  the  PCR  using  DNA  primers  specific  to  known 
G  proteins  and  subcloned  into  two  mammalian  expression  vectors,  pcDNA  3.1 
(InVitrogen)  and  PDNR-lr  (Clontech).  The  clones  were  sequence-verified,  and 
expression  verified  in  most  cases  by  coupled  in  vitro  transcription/translation  assays  and 
the  catalog  of  clones  is  shown  (appendix  A). 

Cloning  of  the  wild-type  G-protein  aolf  subunit  (appendix  B)  and  its 
constitutively  active  form  (Appendix  C)  is  given  as  an  example  of  the  clones  isolated  by 
the  Guthrie  cDNA  Resource  Center.  The  complete  coding  sequence  for  wild-type  aolf 
and  the  location  of  the  mutation  introduced  to  change  a  glutamine  (Q)  to  leucine  (L)  to 
eliminate  GTPase  activity  yielding  a  constitutively  active  phenotype  is  indicated. 
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Key  Research  Accomplishments 


1 .  Analysis  of  the  4.0  kb  genomic  sequence  immediately  upstream  of  the 
transcriptional  start  site  of  the  Sen  10a  gene  revealed  that  the  distal  1 .5  kb  portion 
was  essential  for  gene  activation  in  DRGs. 

2.  The  transcription  factor  c-Jun  was  shown  to  bind  in  vitro  within  the  -3 100  to 
-3200  region  contained  on  this  4.0  kb  fragment. 

3.  At  least  five  other  transcription  factors  bind  within  the  region  -3 100  to  -3500, 
and  their  identities  are  as  yet  unknown. 

4.  A  large  collection  of  cDNAs  containing  binding  domains  for  putative 
transcription  factors  that  interact  within  the  -3 100  to  -3200  region  were  identified 
by  a  yeast  one-hybrid  protocol. 

5.  A  large  collection  of  cDNAs  encoding  wild-type  and  mutant  forms  of  Ga,  G|3, 
and  Gy  subunits  were  constructed  for  future  analysis  into  their  role  in  activating 
the  Sen  10a  tetrodotoxin-resistant  sodium  channel. 


Reportable  Outcomes 

None 


Conclusions 

The  focus  of  this  funding  period  has  been  on  the  4.0  kb  genomic  fragment 
immediately  upstream  of  the  transcriptional  start  site  for  the  ScnlOa  gene.  The  distal  1.5 
kb  portion  was  shown  to  be  essential  for  ScnlOa  gene  expression  in  transfected  DRGs. 
Because  of  the  relatively  large  size  of  this  region,  we  sub-divided  it  into  100  bp  sections 
and  analyzed  these  regions  by  EMSA  for  binding  of  DRG  nuclear  extract  protein  and 
found  that  the  -3100  to  -3500  region  efficiently  bound  several  proteins  in  vitro.  The 
DNA  sequence  of  this  region  showed  the  presence  of  API  (c-Jun)  binding  sites  that  was 
confirmed  by  EMSA  with  purified  c-Jun  protein. 

To  date,  the  -3100  to  -3200  region  has  been  analyzed  for  transcription  factor 
binding  sites  using  the  yeast  one-hybrid  assay.  We  have  isolated  42  cDNA  clones  that 
contain  at  least  the  binding  domains  for  putative  transcription  factors  that  interact  within 
this  region.  Analysis  of  these  cDNAs  is  in  progress  including  isolation  of  their  full- 
length  coding  sequence.  This  will  allow  us  to  determine  whether  any  of  these  cDNAs  are 
functional  in  co-transfection  analyses  along  with  ScnlOa  promoter-EGFP  reporter 
constructs  into  primary  cultures  of  DRGs.  The  -3200  to  -3500  region  is  to  be  included 
in  the  focus  of  the  next  funding  cycle. 

We  have  cloned  a  large  number  of  cDNAs,  both  wild-type  and  mutant  forms,  for 
Ga,  G(3,  and  Gy  subunits  that  are  expected  to  be  useful  in  future  experiments  to 
determine  their  role  in  regulating  expression  of  the  tetrodotoxin-resistant  ScnlOa  sodium 
channel. 
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Notes  The  Q214L  mutation  was  introduced  into  the  human  G-protein  alpha  olf  (GNA0L00000)  via 
the  Quickchange  mutagenesis  kit  (Stratagene).  The  mutation  reduces  GTPase  activity 
resulting  in  a  constitutively  active  phenotype.  Insert  size  =  1150  bp. 
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GCTAGCGTTTAAACTTAAGCTTGGTACCACC  ATG  GGG  TGT  TTG  GGC  GGC  AAC 

M  G  C  L  G  G  N> 
HUMAN  G-PROTEIN  ALPH _ > 


60  70  80  90 

AGC  AAG  ACG  ACG  GAA  GAC  CAG  GGC  GTC  GAT  GAA  AAA  GAA  CGA  CGC 
SKTTEDQGVDEKERR> 
_ a _ a _ a  HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 


100  110  120  130  140 

GAG  GCC  AAC  AAA  AAG  ATC  GAG  AAG  CAG  TTG  CAG  AAA  GAG  CGC  CTG 
EANKKIEKQLQKERL> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

150  160  170  180 

GCT  TAC  AAG  GCT  ACC  CAC  CGC  CTG  CTG  CTC  CTG  GGG  GCT  GGT  GAG 
AYKATHRLLLLGAGE> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 


190  200  210  220  230 

TCT  GGG  AAA  AGC  ACT  ATC  GTC  AAA  CAG  ATG  AG G  ATC  CTG  CAC  GTC 
SGKSTIVKQMRILHV> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

240  250  260  270 

AAT  GGG  TTT  AAT  CCC  GAG  GAA  AAG  AAA  CAG  AAA  ATT  CTG  GAC  ATC 
NGFNPEEKKQKILDI> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

280  290  300  310  320 

CGG  AAA  AAT  GTT  AAA  GAT  GCT  ATC  GTG  ACA  ATT  GTT  TCA  GCA  ATG 
RKNVKDAIVTIVSAM> 
_ a _ a _ aJHUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 


330  340  350  360 

AGT  ACT  ATA  ATA  CCT  CCA  GTT  CCG  CTG  GCC  AAC  CCT  GAA  AAC  CAA 
STIIPPVPLANPEN  Q> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

370  380  390  400  410 

TTT  CGA  TCA  GAC  TAC  ATC  AAG  AGC  ATA  GCC  CCT  ATC  ACT  GAC  TTT 
FRSDYIKS  IAPITDF> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 


420  430  440  450 

GAA  TAT  TCC  CAG  GAA  TTC  TTT  GAC  CAT  GTG  AAA  AAA  CTT  TGG  GAC 
EYSQEFFDHVKKLWD> 


_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

460  470  480  490  500 

GAT  GAA  GGC  GTG  AAG  GCA  TGC  TTT  GAG  AGA  TCC  AAC  GAA  TAC  CAG 
DEGVKACFERSNEYQ> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

510  520  530  540 

CTG  ATT  GAC  TGT  GCA  CAA  TAC  TTC  CTG  GAA  AGA  ATC  GAC  AGC  GTC 
LIDCAQYFLERIDSV> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a  a  a _ > 

550  560  570  580  590 

AGC  TTG  GTT  GAC  TAC  ACA  CCC  ACA  GAC  CAG  GAC  CTC  CTC  AGA  TGC 
SLVDYTPTDQDLLRO 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

600  610  620  630 

AGA  GTT  CTG  ACA  TCT  GGG  ATT  TTT  GAG  ACA  CGA  TTC  CAA  GTG  GAC 
RVLTSGIFETRFQVD> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

>Q2 14L 

I 

640  650  660  670  |  680 

AAA  GTA  AAC  TTC  CAC  ATG  TTT  GAT  GTT  GGT  GGC  CTG  AGG  GAT  GAG 
KVNFHMFDVGGLRDE> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

690  700  710  720 

AGG  AGA  AAA  TGG  ATC  CAG  TGC  TTT  AAC  GAT  GTC  ACA  GCT  ATC  ATT 
RRKWIQCFNDVTAI  I> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

730  740  750  760  770 

TAC  GTC  GCA  GCC  TGC  AGT  AGC  TAC  AAC  ATG  GTG  ATT  CGA  GAA  GAT 
YVAACSSYNMVIRED> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

780  790  800  810 

AAC  AAC  ACC  AAC  AGG  CTG  AGA  GAG  TCC  CTG  GAT  CTT  TTT  GAA  AGC 
NNTNRLRESLDLFES> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

820  830  840  850  860 

ATC  TGG  AAC  AAC  AGG  TGG  TTA  CGG  ACC  ATT  TCT  ATC  ATC  TTG  TTC 
IWNNRWLRTIS  I  IL  F> 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

870  880  890  900 

TTG  AAC  AAA  CAA  GAT  ATG  CTG  GCA  GAA  AAA  GTC  TTG  GCA  GGG  AAA 
LNKQDMLAEKVLAGK> 
_ a _ a _ a__HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

910  920  930  940  950 

TCA  AAA  ATT  GAA  GAC  TAT  TTC  CCA  GAA  TAT  GCA  AAT  TAT  ACT  GTT 


SKIEDYFPEYANYTV> 

_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

960  970  980  990 

CCT  GAA  GAC  GCA  ACA  CCA  GAT  GCA  GGA  GAA  GAT  CCC  AAA  GTT  ACA 
PEDATPDAGEDPKVT> 

_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

1000  1010  1020  1030  1040 

AGA  GCC  AAG  TTC  TTT  ATC  CGG  GAC  CTG  TTT  TTG  AGG  ATC  AGC  ACG 
RAKFF  IRDLFLRI  ST> 

_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

1050  1060  1070  1080 

GCC  ACC  GGT  GAC  GGC  AAA  CAT  TAC  TGC  TAC  CCG  CAC  TTC  ACC  TGC 
ATGDGKHYCYPHFTO 
_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

1090  1100  1110  1120  1130 

GCC  GTG  GAC  ACA  GAG  AAC  ATC  CGC  AGG  GTG  TTC  AAC  GAC  TGC  CGC 
AVDTENIRRVFNDC  R> 

_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 

>XhoI 
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1140  1150  1160  1170  | 

GAC  ATC  ATC  CAG  CGG  ATG  CAC  CTC  AAG  CAG  TAT  GAG  CTC  TTG  TGA  C 
DIIQRMHLKQYELL*> 

_ a _ a _ a_HUMAN  G-PROTEIN  ALPHA  OLF  Q214L _ a _ a _ a _ > 
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TCGAGTCTAGAGGGCCCGTTTA 


